Automated Detection and Segmentation of Table of Contents Page from Document Images
نویسندگان
چکیده
With an aim to extract the structural information from the table of contents (TOC) to help develop digital document library the requirement of identifying/segmenting the TOC page is obvious. The objective to create digital document library is to provide a non-labour intensive, cheap and flexible way of storing, representing and managing the paper document in electronic form to facilitate indexing, viewing, printing and extracting the intended portions. Information from the TOC pages be extracted to use in document database for effective retrieval of the required pages. In this paper we present fully auotmatic identification and segmentation of table of contents (TOC) page from scanned document.
منابع مشابه
Persian Printed Document Analysis and Page Segmentation
This paper presents, a hybrid method, low-resolution and high-resolution, for Persian page segmentation. In the low-resolution page segmentation, a pyramidal image structure is constructed for multiscale analysis and segments document image to a set of regions. By high-resolution page segmentation, by connected components analysis, each region is segmented to homogeneous regions and identifyi...
متن کاملA New Algorithm for Skin Lesion Border Detection in Dermoscopy Images
Background: With advances in medical imaging systems, digital dermoscopy has become one of the major imaging modalities in the analysis of skin lesions. Thus, automated segmentation or border detection has a great impact on the subsequent steps of skin cancer computer-aided diagnosis using demoscopy images. Since dermoscopy images suffer from artifacts such as shading and hair, there is a need ...
متن کاملDocument Analysis And Classification Based On Passing Window
In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...
متن کاملA Semi-Automated Algorithm for Segmentation of the Left Atrial Appendage Landing Zone: Application in Left Atrial Appendage Occlusion Procedures
Background: Mechanical occlusion of the Left atrial appendage (LAA) using a purpose-built device has emerged as an effective prophylactic treatment in patients with atrial fibrillation at risk of stroke and a contraindication for anticoagulation. A crucial step in procedural planning is the choice of the device size. This is currently based on the manual analysis of the “Device Landing Zone” fr...
متن کاملSegmentation of heterogeneous document images : an approach based on machine learning , connected components , and texture analysis
Document page segmentation is one of the most crucial steps in document image analysis. It ideally aims to explain the full structure of any document page, distinguishing text zones, graphics, photographs, halftones, figures, tables, etc. Although to date, there have been made several attempts of achieving correct page segmentation results, there are still many difficulties. The leader of the p...
متن کامل